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ABSTRACT 

In the run up to routine observations with the upcoming generation of radio facilities, the 
nature of sub-mJy radio population has been hotly debated. Here, we describe multi-frequency 
data designed to probe the emission mechanism that dominates in these faint radio sources. 
Our analysis is based on observations of the Lockman Hole using the Giant Metre-wave Radio 
Telescope (GMRT) - the deepest 610-MHz imaging yet reported - together with 1.4-GHz 
imaging from the Very Large Array (VLA), well matched in resolution and sensitivity to the 
GMRT data (creioMHz ^ 15 /iJy beam^^, cri.4GHz ^ 6 /iJybeam^^, FWHM ~ Sarcsec). The 
GMRT and VLA data are cross-matched to obtain the radio spectral indices for the faint radio 
emitters. Statistical analyses show no clear evolution for the median spectral index, ai^ion^^ 
(where Si, oc z^"), as a function of flux density, a^^^^l^^ is found to be approximately —0.6 
to —0.7, based on an almost unbiased 10-cr criterion, down to a flux level of S'i.4ghz ^ 
100 /iJy. The fraction of inverted spectrum sources (ai^4GH^^ > 0) is less than 10 per cent. 
The results suggest that the most prevalent emission mechanism in the sub-mJy regime is 
optically-thin synchrotron, ruling out a dominant flat spectrum or ultra-steep spectrum radio 
population. The spectral index distribution has a significant scatter, Aa k, 0.4 — 0.5, which 
suggests a mixture of different populations at all flux levels. Spectroscopic classification of 
radio sources with X-ray emission has allowed us to estimate that the fraction of radio-quiet 
AGN at 30/iJy < S'i.4ghz < 300 ^Jy is roughly 25 ± 10 per cent, suggesting that star- 
forming galaxies dominate the sub-mJy regime. 
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1 INTRODUCTION 

In early studies, radio astronomy was limited to bright sources 



associated with rare luminous (I/i.4ghz 



10^ 



' WHz"^) ra- 



dio galaxies and quasars (QSOs). Galaxies with nuclear activity 
are usually characterised by powerful radio lobes, which are evi- 
dence of interactions between highly collimated relativistic flows ~ 
coming from the nuclear activity - and the interstellar/intergalactic 
m edium. These magnificen t radio-loud structures were classified 
bv lFanaroff & RilevI 7T974h depending on their shape (FR I and II 
classes), and optical identifications showed that these active galac- 
tic nuclei (AG N) are usually hosted by massive elliptical galax- 
ies jMatthews e t al. 1964). It was not until the 1980s that radio 
sour ce counts at the sub-mJy level revealed a new r adio popula- 
tion ^Windhorst eFZIl 19851 : [Mitchell & Condonll 19851) . The nature 
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of the faint radio sources which dominate the number counts below 
~ 1 miy is controversial. Various studies (e.g. lSimpson et alj2006l : 
ISevmour et aLlbOOSlJSmolcic et ^|2008|) have identified this pop- 
ulation with star-forming galaxies (starbursts, spirals or irregulars) 
and radio-quiet AGN (faint FRi, Seyfert galaxies). 

The fractions of AGN and star-forming galaxies that con- 
tribute to the sub-mly radio regime is still hotly debated. Many 
different approaches have been taken to disentangle t hese two pop- 
ulatio n s: using fa r -infra red (far-IR)/radio flux ratios (Ponl ev et al.l 
l2005l : llbar et al.l l2008h : tack ling their radio b rightn ess tem- 
peratu res and luminosities jWrobel et alj l2005i: iGarrett et al.l 
I2OO5I): resolving thei r radio morphologies iMuxlow et alj l2005l : 
Bigg s & Ivisonll2008l): identify ing optical host galaxies via mor- 
phology (Padovani et al. '2007*), or spectroscopy (Gruppioni et ajj 
l1999; Barger et al. 2 007), or their locus in colour-colour di agrams 
Iciliegi et al.l boosl) : via X-ray identifications (Simp son et al.l 
I2OO6I) : or using their radio spectral indices dRichard j I2OO0I : 
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IClemens et ai]|2008h . These approaches tend to yield substantially 
different results. 

In terms of spectral indices, a, star-forming galaxies are usu- 
ally considered to have a mean spectral index between —0.8 and 
—0.7 (where S oc v"), with a relatively small dispersion, 0.24 
jCond on 1992). A sample of z < 0.5 FR I & II sources have 
also been found to have similar spectral indexes (between 178 MHz 
and 750 MHz), with mean and s catter of a = 0.7 4 ± 0.19 and 
a = 0.79 ± 0.14, respectively jLaing et"al]|l983l) . This implies 
that studies based on the radio spectral index have large diffi- 
culties disentangling star-forming from steep-spectrum FR-AGN 
populations. Nevertheless, the ra dio spectral index is sens itive to 
core-dominated radio-q uiet AGN telundell & Kuncic'2007|), GHz- 
peaked sources (G PS; lOopal-K rishna et al. 1983; 0'De3 ll998l ; 
Snellen et al. 2000) an d the ultra-steep spect r um sources (USS ; 
Rottgering et al.. .19941 ; [Chambers et atl 1 19961 ; IJarvis et all 1200 ll) 
usually found at high redshift. 

Recent studies, combining 610-MHz and 1.4-GHz data, have 
found evidence for flatter spectral indices (Bondietal. 2007; 
iGam et aLll2008al) and larger d ispersions at sub-mjy radio fluxes 
(e.g. iMagliocchetti et al.l |2008|) , suggesting that core-dominated 
radio-quiet AGN are playing a key role in the sub-mJy radio popu- 
lation. 

In this paper, we present two very deep radio images centred 
on the Lockman Hole (LH): the deepest to date at 610 MHz (ct ~ 
15 //Jy beam^^) obtained using the Giant Metrewave Radio Tele- 
scope (GMRT), and a deep 1.4-GHz image (a ~ 6/iJy beam~^) 
obtained using the Very Large Array (VLA). At these long wave- 
lengths the dominant powering process is synchrotron radiation. 
We merge the two datasets to characterise the spectral index of the 
/ijy radio population as a function of flux density, thereby prob- 
ing the physical mechanisms that dominate in this enigmatic radio 
population: optically thin (steep spectrum) or self-absorbed (hard 
spectrum) synchrotron emission. Our deep, well-matched observa- 
tions - about three times deeper than previous data - mitigate the 
well-known bias towards the detection of steeper spectra at longer 
wavelengths, or flatter spectra at shorter wavelengths. This work 
provides a parameterisation of the radio spectral energy distribu- 
tion (SED) that can be used to estimate more precise fc-corrections 
for the observed radio emitters. 

Throughout this paper we assume a Universe with Jim = 
0.27, Q.A = 0.73 andi/o = 71kms-iMpc"^ 

2 RADIO OBSERVATIONS 
2.1 GMRT 

During six 12-hr sessions in 2006 February and July we obtained 
data for three pointings (primary beam FWHM ~ 43 arcmin) in the 
Lockman Hole (see Table[Tll, separated by 11 arcmin (LOCKMAN- 
E, LOCK-3, LHEX4), typically with 28 of the 30 antennas that com- 
prise the GMRtQ near Pune, India. The total integration time in 
each field, after overheads, was 16hr. We recorded 128 channels 
=I6MHz each) centred on 610MHz every 16s integration 
in the lower and upper sidebands (LSB and USB centred at 602 
and 618 MHz, respectively) in each of two polarisations. Integra- 
tions of 40-min duration were interspersed with 5-min scans of 
the nearby calibrator, 1035+564 (S'eioMHz ~2Jy), with scans of 

^ GMRT is run by the National Centre for Radio Astrophysics of the Tata 
Institute of Fundamental Research. 



3C48, 3C 147 and 3C286 (SeioMHz = 29.4, 38.3 and 21.1 Jy, re- 
spectively) for flux and bandpass calibration. 

Calibration initially followed standard recipes within AIVS, 
using FITLD, INDXR and SETJY. However, because of concerns that 
some baselines were picking up signal from local power lines and 
that 1035-1-564 might be too weak to act as a reliable secondary flux 
calibration source, a raft of new measures were introduced to avoid 
detrimental effects on the resultant images. 

For each session, the bright source least affected by radio- 
frequency interference (RFI) and with the fewest malfunctioning 
antennas was SPLIT and chosen to be the primary flux density and 
bandpass calibrator. After intensive manual flagging of RFI us- 
ing SPFLG and TVFLG, the chosen calibrator was self-calibrated in 
phase. Antenna-based bandpass solutions were determined, copied 
to the full dataset and used to determine new gain solutions for 
the primary calibrator. The gain and bandpass solutions were then 
applied to the entire dataset with no time-dependent corrections. 
The secondary calibrator was employed to identify problems with 
the antennas rather than to track changes in gain, although a more 
conventional approach was used to generate images with good po- 
sitional information for use in initial phase self-calibration. 

Next, calibrated data were processed with the FLGIT RFI- 
rejection algorithm. Each 128-channel integration was split into a 
series of seven 15-channel pieces, discarding the first 10 and last 
13 channels, and points deviating from linear fits to each piece by 
more than 5 a were rejected. Data brighter than 1.5 x the brightest 
calibrator were also rejected, leaving around 70 to 95 per cent of 
the original data intact, depending on the severity of the RFI. 

The resulting data, now somewhat cleaner, were averaged 
down to yield 15 channels in each polarisation for each ses- 
sion, pointing and sideband: a total of 12 dual-polarisation, single- 
sideband, single-source datasets. 

Before imaging, a specially modified version of UVAVG (now 
standard) was employed to determine and subtract the average 
value for each baseline and channel throughout the entire session 
(hence the need for time-independent calibration in the preceding 
steps). 

Imaging each of these datasets entailed the creation of a mo- 
saic of 37 facets, each 512^ pixels (1.25^-arcsec^ per pixel), to 
cover the primary beam. A further 6-12 bright sources outside 
these central regions, identified in heavily tapered maps, were also 
imaged. Our aim was to obtain the best possible model of the sky. 
CLEAN boxes were placed tightly around all radio sources for use 
in self-calibration, first in phase alone (SOLMODE = 'a!p'), then in 
amplitude and phase (SOLMODE = 'A&P'), with a solution interval 
of 2 min, staggered by 1 min. The uv data were weighted using RO- 
BUST = —0.5, UVRANGE = 0.8, 100 kA and uvtaper = 28, 28 kA 
with UVBOX = 5. 

After CLEAN components were subtracted from the uv data, 
more manual flagging was applied, as well as another pass through 
the UVAVG task and a clip at the 350-mJy level using UVFLG 
(now CLIP). CLEAN components were re-introduced (UVSUB, 
FACTOR=— 1), then data with common sidebands from February 
and July were combined using DBCON to reduce the variation in 
beam size and shape amongst the datasets. 

The final six mosaics (see Table [T), two for each pointing 
(LSB and USB), were then convolved to a common beam size 
(7.1 arcsec x 6.5 arcsec, with the major axis at position angle 70°) 
The noise (see Table [T]! from each image is estimated and intro- 
duced in their headers using IMEAN, before all are knitted together 
using FLATN. An appropriate correction was made for the attenua- 
tion of the primary beam, with data rejected at radii beyond where 



© 0000 RAS, MNRAS 000, 000-000 



The nature of the sub-mJy radio population 3 



57 - 



100 



15D 



8 



-1= 

z 
□ 
u 

lU 

a 



21 - * 
20 - 
l9 - 

ia - 

17 - 



4 



10 53 DO 



52 45 



■3& 



15 00 51 45 30 

RIGHT ASCENSION <J3000) 



15 



00 



50 45 



S" 

o 

3- 
I- 

4 



(J 

LU 

Q 



57 aa I- 

21 h ' 

EO 

19 

IB - 
17 - 
16 - 



100 

— 1— 




IO53O0 92 49 30 13 00-9149 30 

Rl QHT ASCEM5I0M 1 J»») 



19 



CO 



90 49 



Figure 1. Top: A small region (18.7x7.5 arcmin'^) near the centre of the 1.4-GHz mosaic. The linear grey-scale runs from to 150/xJy beam~^. The 
synthesised beam is 4.3 X 4.2 arcsec^ (FWHM) with a position angle of 77°. Bottom: The deepest 610-MHz image obtained to date, covering the same region 
and with the same linear grey-scale as the VLA image on top. The brightness scale of the image has been multiplied by 0.56 (= [1400/610]", where 
a = —0.7) in order to visually compare it with the 1.4-GHz map. The restoring beam measures 7.1 X 6.5 arcsec^ (FWHM) with a position angle of 70°. 



the gain drops to 30 per cent, i.e. at a distance of ^ 28 arcmin from 
the pointing centre. This final image has a noise level in the central 
~ 100 arcmin^ of 14.7 /iJybeam^^, the deepest map so far reported 
at 610 MHz, despite the modest integration time (16 hr on sky for 
each pointing). 

Based on the brightest pixel in the mosaic, 0.032 Jybeam"^, 
we reach a dynamic range between ~ 2, 200 : 1 and 900 : 1 
considering the central and local r.m.s. noise, respectively. Our im- 
ages, however, m ay be long way from being limited by dynamic 
range. iGarn et al] ( l200 8b) reaches ~9,000: 1 and it appears likely 
that we could push down to the confusion limit in the observations 
discussed in this paper. 



2.2 VLA 

New and archival data were obtained in the same three positions us- 
ing the National Radio Astronomy Observatory 's (NRAC0) VLA, 
largely in its B configuration. At 1400 MHz this yielded images 
(primary beam FWHM ^ 32 arcmin) well matched to the resolu- 
tion of GMRT. We tapered our A- an d B -configuration data in the 
LOCKMAN-E field ( ilvison et alj|2002l) to yield images with a near- 
circular 4.0-arcsec synthesised beam. Using the same techniques 
outlined earlier, we then combined this central field with images 

^ NRAO is operated by Associated Universities Inc., under a cooperative 
agreement with the National Science Foundation. 
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GMRT pointings 



Bandwidth Smearinc 



Field 


R.A. {hr:iiiin:sec 


Dec. (deg:min:.sec) 


r.m.s. (/ijybeam ^) 


LHEX-4 


10:52:56.0 


-1-57:29:06.0 


33.7 (USB) 








29.6 (LSB) 


LOCKMAN-E 


10:51:59.0 


-1-57:21:28.2 


26.2 (USB) 








26.0 (LSB) 


LOCK 3 


10:51:02.0 


-1-57:13:50.4 


24.5 (USB) 








23.7 (LSB) 






VLA pointing.s 




Field 


R.A. {hr:niin:sec 


Dec. (deg:min:sec) 


r.m.s. beam ^ ) 


LHEX-4 


10:52:56.0 


-1-57:29:06.0 


7.2 


LOCKMAN-E 


10:52:08.8 


-1-57:21:33.8 


7.6 


LOCK-3 


10:51:02.0 


-1-57:13:50.4 


11.0 



Table 1. The GMRT and VLA pointings used in tliis work. USB and LSB 
correspond to the upper and lower side bands, respectively. 



made using pure B-configuration data in the two other pointings: 
the designated LOCK-3, 11 arcmin to th e south west, for which 
we obtained 17 hr of data in 2005 March dlvison et al.ll2007h : plus 
archival data for LHEX4, 1 1 arcmin to the north east of LOCKMAN- 
E, which comprises 31hr of integration (see Table [TJ. The final 
mosaic-image has an r.m.s. in the central 100 arcmin of 6.0 ^Jy 
beam~^, and a resolution of 4.3 x 4.2 arcsec^ (FWHM) at position 
angle of 77°. 



3000 - 




Figure 2. Contour plot for the expected point-source bandwidth-smearing 
in the 1.4-GHz mosaic (see Eq.|2). Contours ai'e plotted at: 0.85, 0.87, 0.90, 
0.92, 0.95 and 0.97. Note that the smallest corrections do not occur exclu- 
sively at the image centre. 



3.2 Instrumental effects 

Four important instrumental effects must be taken into considera- 
tion. 



3 CATALOGUES 

Initially, we extracted sources down to a peak-to-local-noise ra- 
tio = 3 (hereafter PNR), in order to identify all possible faint and/or 
extended emission. The sources included in the final catalogues 
were selected to have PNR > 5. 



3.1 Initial source extraction 

Sources were extracted from the final FLATNed mosaics (images of 
Stokes 7), down to a 3-cr (thereafter a refers to the local noise) peak 
level, using the AIVS routine, SAD (CPARM = 500, 100, 50, 10, 6, 
4, 3, 2.5; DPARM(1)=3; DPARM(2)= [15 /iJy for GMRT; 7 /iJy for 
VLA]; ICUT = 0.1; GAIN = 1). To provide a rehable noise-based 
extraction criterion, a noise map was generated from the Stokes / 
image using RMSD (IMSIZE=71,-1; OPTYPE='HIST'). This noise 
map was introduced as a secondary image for SAD (DPARM(9) = 
3), which ensured reliable source extraction around bright sources 
and near the map edges (see The increasing uncertainties 

in the SAD-Gaussian fits toward faint PNRs may result in sources 
having smaller areas than the beamsize (see the smallest CPARM 
parameter). We use a threshold in peak flux density instead of 
a threshold in integrated flux because peak flux density is a lin- 
early independent variable in the SAD fitting procedure (actually, 
in JMFIT), whereas integrated flux density is a product of peak flux 
and source area. This translates into cleaner and more complete 
selection criteria. Note, however, that evidences for an anticorre- 
lation b e tween peak fluxes and source area have been fou nd by 
ICondonI ^199% in images with uncorrelated noise. ICondonI shows 
that this effect disappear when the noise is correlated at similar 
scales than the synthesised beam. Given by the convolution to a 
common beamsize made before using FLATN, we expect no anti- 
correlation in the Gaussian fit parameters. 



3.2.1 Bandwidth smearing 

Bandwidth smearing is inevitable when using channels with finite 
bandwidth. This affects the peak flux densities of sources, but not 
their integrated fluxes. We have estimated this bias through knowl- 
edge of the distance between each source and the different phase 
centres and use a theoretical correction given by: 



To 



where P - 



(1) 



which is valid for point sources, and assumes a Gaussian bandpass 
and circular tapering in the uv plane iTavlor et alJI 19991) . I and Jo 
are the observed and intrinsic peak brightness, Av is the channel 
bandwidth, v is the frequency of the bandpass centre, ^syn is the 
synthesised beamwidth and 9 is the distance between the source 
and the phase centre. This estimation is in agreement with the JM- 
FIT correction when BWSMEAR is set to the channel bandwidth di- 
vided by the center frequency. 

In overlapping regions we have averaged the correction by us- 
ing a minimum variance weighting scheme (see Fig.|2]l, 



/o 



Ij/Ip 

r.m.f 



(2) 



which takes into account the noise for each pointing before primary 
beam correction. This bias was found to be important in the VLA 
image and not always negligible for the GMRT data. The maxi- 
mum correction factor was I /lo ~ 0.84 and I/Io ~ 0.94 for the 
VLA and GMRT mosaics, respectively. For example, Fig.|2]shows 
the bandwidth smearing expected in the VLA mosaic. Note this 
smearing can be used to correct the peak fluxes in the final cata- 
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logues (see column 5 in Table |3] and |4l(, but the selection criterion 
remained unaffected since it was based on the observed peak values 
(uncorrected surface brightness, in Jy beam^^). 

3.2.2 3-D smearing 

The general response equation for a two-element interferometer is 
usually approximated to a 2-D Fourier transform due to the sim- 
plicity of the inversion problem. Nevertheless, this is only valid for 
co-planar baselines (where the w axis lies in the dir ection of the 
celest ial pole) and for sufficiently small fields of view jTavlor et alj 
Il999h . At low frequencies, the primary beam is large so this ap- 
proximation becomes inefficient and a formal 3-D Fourier trans- 
form is required. A smearing effect is expected from the rotation 
of the tangential plane with respect to the celestial sphere when the 
separation between these two is large. 

We can estimate the effect of 3-D smearing in our images. For 
each VLA and GMRT pointing image, we have diameters of ~ 45 
and ~ 57 arcmin, respectively, spanned by seven facets. The maxi- 
mum separation between the tangent plane and the celestial sphere 
is given by A = 1 — cos 0, where (j) is half the subtended angle of 
each facet (~ 6.4 and ~ 8.1 arcmin, respectively). In our images the 
maximum separation between each tangent facet and the celestial 
sphere is ~0.09arcsec at 1.4GHz and ~0.14arcsec at 610MHz. 
In both cases, this separation is equivalent to approximately 2 per 
cent of the synthesised beamwidth. We consider this bias negligible 
and no correction to the observed data was made to correct for 3-D 
smearing. 

3.2.3 Time-delay smearing 

This smearing is due to the rotation of celestial sources with respect 
to the phase tracking centre during the integration time, i.e. longer 
sampling times lead to more blurred images. 

In our observations we used 16- and 5-s integration times to 
collect data from the GMRT and VLA, respectively . Considering 
theoretical assumptions (again, see lTavlor et al.ll999l) . we find that 
the maximum loss in peak intensity (expected at the edge of the 
field of view) is < 2 per cent for GMRT and < 0.3 per cent for 
the VLA. Consequently, we do not implement any correction to the 
data for time-delay smearing. 

3.2.4 Primary beam attenuation 

Primary beam attenuation is the intrinsic loss in gain as a func- 
tion of distance from the pointing centre due to the antenna re- 
sponse. VLA images were corrected using the default 10th order 
fit to the beam response at 1 .4 GHz, described in AIVS (EXPLAIN 
PBPARM). For the GMRT images we used the 8th order polynomial 
fit reported by N. G. Kantharifl. 

Based on a GMRT mosaic composed of 7 pointings, 
iGam et aL I l l2007h reported the primary beam centre was affected 
by an offset with respect to the phase centre - these two are usu- 
ally coincident. They revealed a systematic difference between the 
apparent brightness of sources observed by adjacent pointings, and 
solved this problem by using a common offset of ~ 2.5 arcmin for 
the primary beam corrections. 

In order to tackle the offset, first we checked that the images 
created by the upper and lower sidebands (IFs - LSB and USB) 

www .ncra.tifr.res. in/~ngk/primarybeam/beam . html 



from each of our three pointings are giving consistent results for 
the source estimations. We do not find deviations besides of typi- 
cal differences in flux calibration of < 5 per cent. Based on these 
results the IFs were combined in the image plane. 

To investigate this thorny issue we define - in overlapping re- 
gions - as the spherical angle formed by the source, the middle 
distance between two pointings and the northerly direction. Flux ra- 
tios are sensitive to pointing variations as a function of 0, as shown 
in Fig. [3] These diagrams can be used to prove if the primary beam 
is really offset with respect to the phase centre. 

In Fig.|3]we plot as a function of the gain factors (J(i) — 
I(2))/{I(i) + 1(2)) and (^(i) - 5'(2))/(S'(i) + S(2)), where / is 
the peak intensity value corrected for bandwidth smearing and S 
is the integrated flux density (the subindexes (ij and (2) show the 
pointings being compared). This estimate is based on single sources 
detected at PNR > 25, and shows no substantial evidence for a 
primary beam offset. In these plots, the signature expected for a 
primary beam offset is a sinusoidal data distribution. To measure 
this effect we use a sinusoidal fit. see dashed lines in Fig. [3] to 
provide the direction of the offset via the phase of the fit as well as 
the amplitude of the offset. 

Based on this method to tackle the pointing offset, we find 
that the use of peak flux values instead of integrated flux densities 
can result in an apparent pointing offset, mostly because of the 
fits obtained for the most widely separated pointings (3-1; bottom 
in Fig. [3j. Smearing effects may thus simulate the behaviour ex- 
pected for a primary beam offset when / values are used. Since we 
have accounted for bandwidth smearing, these fits (if robust) sug- 
gest smearing is more prevalent than expected. This could be due 
to inadequate bandpass calibration, but the cause is fundamentally 
unknown. Since integrated flux densities (S) are not affected by 
smearing effects, they should provide an unbiased estimation for a 
primary beam offset. We show in the right-hand panels of Fig. [3] 
that (f> does not show compelling signatures for primary beam off- 
sets. 

We ran simulations in order to test the reality of the small am- 
plitudes seen in the fits, applying different primary beam cor- 
rections in the directions indicated by the fits, with the idea of min- 
imising the fitted amplitudes. We found that the offsets required 
to remove the pointing offsets were ^ 1 arcmin, with different di- 
rections for all the pointings. This contradict s the sin gle offset of 
~ 2.5 arcmin, in a common direction, used bv lGam et al. (2007). 

Due to the lack of evidence for a significant and consistent 
pointing shift, we decided not to apply any primary beam offset to 
our data. 

3.3 Multiple sources 

The definition and identification of multiple systems is a common 
problem in radio astronomy, i.e. how many fitted Gaussian peaks in 
the image are related to a single galaxy? This is especially difficult 
for deep radio observations where extremely deep optical imaging 
is required to identify the host galaxy. We can look at the image to 
find obvious double-sided jets from bright, extended radio galaxies 
(FRil), but this becomes more difficult at faint flux levels for all 
angular scales, for obvious reasons. 

The classification of multiple systems is also highly 
resolution-dependent. A source adequately described by a single 
Gaussian at 610MHz may require more than one component in 
the higher resolution 1.4 GHz data, confusing catalogues, number 
counts, and the study of spectral indices. Later, we explicitly refer 
to 610-MHz- or 1.4-GHz-selected samples to avoid confusion. 
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Figure 3. These figures compare source estimates from different pointings in overlapping regions. Tlie y-axis sliows tlie variation in peak brightness corrected 
by bandwidth smearing (/ - left) and integrated flux densities (5 - right), for all sources detected with PNR > 25. Subindexes show the pointings being 
compared (see the bottom-right sketches). The x-axis is the angle 4> defined in spherical coordinates by the source position, the middle distance between the 
two pointings and the northerly direction. Dashed lines are sinusoidal fits that can provide the direction and amplitude of a primary beam offset with respect 
to the phase centre. 



In order to identify and classify the sources, we filtered the 
initial 3-a SAD catalogue, excluding all those fits with peak val- 
ues below 4x the local noise. Then we identified all those de- 
tections having close neighbours (with a PNR > 4) based on a 
simple criterion: if the distance between two Gaussians is lower 
than 1.2x the sum of their measured FWHMs in the direction de- 
fined by the angle they form in the sky (see Fig. |4ll, then these 
detections are excluded from the so-called "single-source" sample. 
We have thus applied a "friend-of-a-friend" technique, using an el- 
liptical (direction-dependent) search radius - an efficient classifi- 
cation method. All these sources have been treated independently 
in order to check their reliability. Only a small minority of them 
have been considered as single emitters with more than one Gaus- 
sian component. In Table |2] we describe the source cl assifications. 
This identification is not res t ricted in flux density (e .g. lCiliegi et al.l 
1 19991 : ISevmour et ^|2004| : iBiggs & IvisorJ l2006h because some 
sources display diffuse emission, or have fainter components su- 
perimposed. The classification is presented in Tables|3]and|4l and is 
recommended to bear in mind when cross-matching the catalogues. 

TVSTAT was used to determine the flux densities of extended 
sources (usually sources with more than three Gaussian compo- 
nents - see Table|2ll. This gives a more accurate estimate for com- 
plex systems than summing the various Gaussians. A final inspec- 
tion, by eye, checked the reliability of the components in extended 
sources (including distant radio lobes in some cases), sources show- 
ing side-lobe patterns and sources with diffuse emission not in- 
cluded by the Gaussian fits. For all extended sources, errors in peak 
and flux density are assumed to be 5 per cent of the value reported 
by TVSTAT. 

Two other important source parameters are the observed an- 



"Single sources" "Neighbours" 




Separation > D2 Separation < D.|+ D2 

> 1.2 X (FWHiMi+FWHiMa) < 1.2 X (FWHi«i+FWHi«2) 

Figure 4. A simple sketch for the selection criteria used to identity sources 
with close neighbours. If the source in the right were considered to be a 
double, then it would have an observed angular size given by 'separation + 
(Di + D2)/l .2' (where Dj are 1.2 times the FWHM in the direction to the 
neighbour source), and a position angle of deg. 



gular size and the orientation of the sources. We have measured the 
observed angular size as follows. For multiple systems, it corre- 
sponds to the separation of the furthest components, plus the mea- 
sured FWHM of the components in the direction defined by the an- 
gle they form on the sky (the 'position angle'). For single sources, 
it is equivalent to twice the maximum FWHM. The observed angu- 
lar size parameter and the phase angle are presented in Tables|3]and 

s 
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Table 3. A small sample of the sources found in the Lockman Hole field at 610 MHz using the GMRT. Source extraction is based on peak brightness > 5x 
the local rm.s. and integrated flux density > 3x the local rm.s. criteria. Columns: (1) standard source name; (2) right ascension; (3) declination. We note in 
j]3.5| there is an astrometiic offset between the VLA with respect to the GMRT sources, AR.A. = —0.60 it 0.03 and ADec. = 0.40 it 0.03 arcsec (mean 
offset in R.A. and Dec, respectively). For double and triple systems the position is given by the brightest component. For extended sources it is given by 
the most central component; (4) peak flux to local noise ratio; (5) bandwidth smearing correction ( i|3.2.U ; (6) observed maximum angular size ( ^331 . These 
values are not deconvolved source sizes but those fitted from the observed mosaic. For single sources, this value corresponds to twice the maximum FWHM. 
For multiple sources, it is given by the distance between the furthest components plus the FWHMs of each of them, in the direction they define in the sky; (7) 
The orientation angle (position angle) of the source with respect to North; (8) integrated flux density and estimated eiTors from SAD; (9) classification of the 
source. S = single, D = double (dl & d2 as components), T = triple (tl, t2 & t3 as components) and M = extended (Table|2). The upper index indicates 
sources extracted from the convolved, Areaboam x image ( i]3.4K (10) the radio spectral index between 610MHz and 1.4GHz, including the 68.3 per 
cent confidence range (based on the SAD flux density errors) and upper limits. '-' = outside cross-matching region, ' !' = unreliable spectral index; (11) Special 
flags in spectral indexes: 1 - upper limit, 2 - source which has split the counterpart's flux density in a relative contribution, 3 - estimation which has used the 
original SAD extraction before multiple classification, 4 - cross-matched sources separated by a distance > 3 arcsec, 5 — counterparts having a fitted area ratio 
twice bigger than the expected from point sources (A^o'u^^^ / ^smSSi' > 2 X ^bJ.am^^/^bcam^''-'' ^ ^ estimation based on a cross-match involving more 
than one counterpart, 7 - spectral index affected by close companion, and 8 - source affected by overlapping facets in the 3-D imaging approach. A complete 
version of this table is available as Supplementary Material through the on-line version of this paper. 



lAU n;unc 


Position at 610MHz (J2000) 


PNR 


BWSC 


Miix. size 


Position iingle 


Flu.Kdensily 


Class 


610MHz 
"l.4GHz 


a flags 




hr:min:sec 


deg:min:set; 
















(1) 


(2) 


(3) 


(4) 


(5) 


(6) 


(7) 


(8) 


(9) 


(10) 


(11) 


GMRTLH J 1 05 1 33 .4+5 7 1 45 9 


10:51:33.42 


+57:14:59.9 


7 


0.99 


14.3 


56 


105 ± 25 


S 


„ ..+0.38 
-0.&&_Q_33 




GMRTLHJ105 133.6+571308 
GMRTLHJ105 133.6+565039 


10:51:33.59 
10:51:33.64 


+57:13:08.8 
+56:50:39.1 


9 
5 


0.99 
0.96 


19.1 
17.8 


97 
89 


188± 29 
185± 59 


S 
S 


, f,.+0.26 
^-"^-0.24 




GMRTLHJ105 134.0+570552 


10:51:33.96 


+57:05:52.4 


8 


0.98 


14.6 


83 


116± 25 


S 


'■^ -0.32 
-0 97+0-26 
-0.26 




GMRTLHJ105 134.0+573729 


10:51:34.02 


+57:37:29.2 


11 


0.97 


15.8 


71 


242± 35 


S 




GMRTLHJ105 134.3+570922 


10:51:34.31 


+57:09:22.6 


7 


0.99 


13.9 


65 


177±45 


S 


si+0'37 
^■^^-0.29 




GMRTLHJ105 134.5+573359 


10:51:34.46 


+57:33:59.4 


10 


0.98 


13.6 


67 


195± 34 


S 


^ ^2+0.29 

-1 46+0-3^ 
^-^^^-0.36 
< -0.42 




GMRTLHJ105 134.5+573218 
GMRTLHJ105 134.6+574153 


10:51:34.53 
10:51:34.65 


+57:32:18.2 
+57:41:53.3 


8 
5 


0.98 
0.96 


15.4 
16.1 


178 



181± 36 
168± 46 


S 
S 


1 


GMRTLHJ105 134.8+571801 
GMRTLHJ 1 05 1 35 .0+5646 1 5 


10:51:34.81 
10:51:35.02 


+57:18:01.6 
+56:46:15.7 


7 
7 


0.99 
0.94 


17.5 
16.1 


65 
51 


134± 27 
325 ± 86 


S 
S 


, „^+0.38 
^■^^■^-0.38 




GMRTLHJiO5135.2+570133 
GMRTLHJ105135.3+570122 


10:51:35.19 
10:51:35.27 


+57:01:33.9 
+57:01:22.4 


10 

9 


0.98 
0.98 


17.4 
17.0 


15 
81 


234± 36 
225 ± 36 


SE 
SE 


-0.80_Q 3Q 
< -0.71 


1 


GMRTLHJi05135.6+572738 


10:51:35.60 


+57:27:38.9 


23 


0.99 


14.0 


64 


324± 24 


S 


-0 69+0-12 




GMRTLHJ 1 05 1 3 5 .6+57004 1 


10:51:35.61 


+57:00:41.5 


6 


0.98 


19.4 


176 


215± 49 


S 


< -0.45 


1 


GMRTLHJ 1 05 1 3 5 .8+57 1 344 


10:51:35.84 


+57:13:45.0 


6 


0.99 


22.6 


47 


145± 32 


S 


-1 26+0-43 
^-^^^-0.45 
35+0-50 

< -0.31 




GMRTLHJ105 135.9+573728 
GMRTLHJ105 136.0+573424 


10:51:35.92 
10:51:36.05 


+57:37:28.1 
+57:34:24.7 


6 
5 


0.97 
0.98 


13.5 
13.3 


7 
7 


92± 28 
73± 26 


S 
S 


1 


GMRTLHJ 1 05 1 36. 1 +5744 1 


10:51:36.06 


+57:44:10.0 


5 


0.97 


17.8 


37 


130± 38 


S 


< -0.28 


1 


GMRTLHJ105 136.2+572959 


10:51:36.20 


+57:29:59.1 


27 


0.99 


15.3 


72 


605 ± 36 


S 


„^+0.09 

-0 96+0-39 
^'-■^^-0-42 




GMRTLHJ105 136.3+57065 1 


10:51:36.28 


+57:06:51.2 


9 


0.98 


13.2 


41 


132± 25 


SD 




GMRTLHJ 1 04809. 1 +57041 4 


10:48:09.07 


+57:04:14.9 


5 


0.95 


30.0 


33 


330± 90 


S(X2) 







Table 4. The 1.4-GHz catalogue. See Table|3]for details. 



lAU name 
(1) 


Position at 1.4 GHz {J2000) 
hr: min : sec lieg : min: sec 
(2) (3) 


PNR 
(4) 


BWSC 
(5) 


Max. size 
(iiresec) 
(6) 


Position angle 
(lieg) 
(7) 


Flu\ density 

(MJy) 

(8) 


Class 
(.9) 


„610MHz 
°^1.4GHz 

(10) 


a Hags 
(11) 


VLALHJ105211.4+571551 


10:52: 


.11.44 


+57:15:51.7 


5 


0.95 


8.3 


178 


28± 9 


SD 


-0. 


.0 + 0.40 , 
^-^-0.50 ■ 


2 


VLALHJ10521 1,5+573953 


10:52: 


.11.48 


+57:39:53.2 


21 


0.90 


9.2 


46 


193± 17 


S 


0. 


, ,+0.28 
-0.24 




VLALHJ10521 1 .8+5735 1 


10:52: 


.11.82 


+57:35:10.2 


12 


0.94 


7.1 


37 


53± 8 


SD 


-1. 


..+0.35 , 
^^-0.32 - 


5 


VLALHJ10521 1.9+570540 


10:52: 


.11.86 


+57:05:40.5 


7 


0.91 


9.9 


161 


58± 16 


S 


-0 


^^ + 0.50 
■^^-0.49 




VLALHJ105212.0+572321 


10:52: 


: 12.04 


+57:23:21.6 


6 


0.97 


7.4 


146 


24± 7 


s 


-1 


4g + 0.54 
■^^-0.54 




VLALHJ 1 052 1 2. 1 +573454 


10:52: 


: 12.08 


+57:34:54.6 


9 


0.95 


8.0 


161 


46± 10 


s 




■ -0.98 


I 


VLALHJl 052 1 2. 1 +57262 1 


10:52: 


:12.11 


+57:26:21.4 


10 


0.97 


10.2 


136 


64± 10 


s 


-0 


oa + 0.52 
■■^»_0.41 




VLALHJl 052 1 2.2+57 1 525 


10:52: 


:12.16 


+57:15:25.4 


9 


0.94 


10.4 


73 


68± 12 


s 


-1 


-0.28 




VLALHJ105212.3+571549 


10:52: 


: 12.27 


+57:15:49.5 


19 


0.95 


8.0 


90 


84± 8 


SD 


-0. 


53+0-21 , 
^■^-0.20 ■ 


2 


VLALHJ 1052 12.5+572453 


10:52: 


: 12.49 


+57:24:53.1 


48 


0.97 


9.2 


97 


278± 10 


s 


-0 


R^ + 0.10 
■°^-0.09 




VLALHJ 1 052 1 2.6+57064 1 


10:52: 


: 12.63 


+57:06:41.3 


6 


0.92 


10.2 


32 


56± 17 


s 


-1 


ei+0.41 

■'^^-0.48 




VLALHJ 1052 13.3+572650 


10:52: 


: 13.29 


+57:26:50.6 


12 


0.97 


9.3 


156 


71± 9 


s 


> -0.30 


1 


VLALHJ 1 052 1 3.4+57 1 605 


10:52: 


: 13.38 


+57:16:05.3 


52 


0.95 


8.8 


59 


301 ± 10 


s 


-0 


„„+U.06 
■ ' -0.06 




VLALHJ 1 052 1 3.4+572600 


10:52: 


: 13.44 


+57:26:00.2 


10 


0.97 


7.8 


161 


45 ± 8 


s 


-0 


■*''-0.41 




VLALHJ 1 052 1 3.6+574436 


10:52: 


: 13.64 


+57:44:36.0 


6 


0.89 


7.8 


166 


64± 21 


s 


> -0.93 


1 


VLALHJ 1052 13.8+57 1338 


10:52: 


: 13.76 


+57:13:38.9 


13 


0.93 


9.2 


5 


89± 11 


SD 


-0. 


^^-0.23 ' 


2 


VLALHJ 1 052 1 3.9+573935 


10:52: 


: 13.89 


+57:39:35.9 


9 


0.90 


7.4 


124 


54± 12 


s 


> -0.95 


1 


VLALHJ 1 052 1 4.0+57 1 84 1 


10:52: 


: 14.04 


+57:18:42.0 


6 


0.96 


11.9 


107 


44± 11 


s 


-1 


OO + 0.42 
■■^^-0.44 




VLALHJ 1 052 1 4.2+573 1 40 


10:52: 


:14.18 


+57:31:40.9 


7 


0.97 


14.1 


38 


91± 17 


s 


> -0.62 


1 


VLALHJ 1 052 14.2+573328 


10:52: 


.14.21 


+57:33:28.2 


7 


0.96 


11.9 


35 


85± 17 


s 


-0 


+ U.41 
-0.37 




VLALHJ105214.6+571335 


10:52: 


.14.60 


+57:13:35.9 


5 


0.93 


7.9 


62 


24± 8 


SD 


-0. 


«„+0.42 , 
^^-0.54 ■ 


2 


VLALHJ 1 049 1 4.4+5702 1 


10:49: 


14.41 


+57:02:10.4 


5 


0.86 


14.2 


145 


23 1 ± 69 


S{X2) 


-0 


.„ + 0.36 
■■^'^-0.44 
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Class Description 



GMRT-VLA Crossmatch 



S Single Gaussian source without close neighbours 

SD Single Gaussian source with one close neighbour 

ST Single Gaussian source with two close neighbours 

SE Single Gaussian source with multiple close neighbours 

D Double Gaussian source without close neighbours 

DT Double Gaussian source with one close neighbour 

T Triple Gaussian source without close neighbours 

M Extended source composed of more than three Gaussians 



Table 2. Source classifications for the radio catalogues presented in Tables|3] 
and|4] By 'neighbours' we mean sources extracted by SAD with PNR > 4, 
using the criteria shown in Fig.|4] 



3.4 Source catalogues 

Various clean-up processes were applied to the initial catalogues 
produced by SAD (see First, we eliminated those sources 

lying closer than 30 pixels (24 and 37.5arcsec at 1.4 GHz and 
610 MHz) from the image border, where the noise is considerably 
higher, and removed sources forming part of multiple structures 
that have been considered as single emitters. The final catalogues 
comprise sources with PNR > 5 (uncorrected by bandwidth smear- 
ing) and integrated fluxes in excess of 3 x the local r.m.s. (this 
avoids a small number of sources with extremely small sizes). 

Lastly, we ran both source extractions again using 
\/2x Areabearrr convolvcd images to include extended emis- 
sion missed by the first selection process. We found 16 and 43 new 
sources in the GMRT and VLA mosaics, respectively. 

Final catalogues are presented in Tables [3] and |4] We have 
identified a total number of 1 ,587 and 1 ,452 sources with PNR ^5o" 
at 610MHz and 1.4GHz, respectively. 




-2 





RA offset (arcsec) 



Figure 5. Relative offsets between single sources found at 1.4 GHz with 
respect to the 610-MHz GMRT positions. Offsets are approximately nor- 
mally distributed. In R.A. and Dec. we find mean offsets of —0.60 it 0.03 
and 0.40 it 0.03 arcsec, respectively. Dashed lines show the mean offset 
values. 



4 NUMBER COUNTS 

We have derived number counts in the Lockman Hole using the 
catalogues shown in Tables[3]and|4] The differential number counts, 
dN/ dS, were calculated using the observed number of sources per 
bin of flux density, A'^, divided by the bin width (AS in Jy) and by 
the effective area (Acs in steradians) available for detection. 



3.5 Astrometric precision 

We plot in Fig. |5] the offsets of the VLA sources with re- 
spect to the GMRT positions, using only single sources ( ij3.3b . 
The offset distribution is approximately Gaussian with a mean 
of AR.A. = -0.60 ± 0.03 and ADec. = 0.40 ± 0.03 arcsec. 
These mean offsets, ~ 0. 5 arcsec, ar e observed at all flux levels, and 
their origin is unknown. iGarn et al] ([2007) found an incorrect time 
stamp in the GMRT data, resulting in a rotation of the positions 
near the edge of each pointing. This problem was corrected during 
the reduction of our data and is not responsible for the observed 
offsets, which may instead relate to VLA correlator i ssues which 
have only recently come to light jMorrison et"ai]|2009l . m prepara- 
tion), or be due to a different position for the phase calibrators used 
at the different frequencies. 

An external refer ence was used to te st the 610-MHz astrom- 
etry - recent work by iGarn et aL I ( l2008bl) . which includes a 610- 
MHz observation in the Lockman Hole. We find about 90 common 
sources (see i]7.1| for details) with a median offset of AR.A. = 
-0.54 ± 0.05 and ADec. = 0.00 ± 0.04. These offsets help ex- 
plain the R.A. offset found in Fig. [5] but not that in Dec. 

We have been unable to find a straightforward reason for the 
observed offsets between the VLA 1.4-GHz and GMRT 610-MHz 
sources. We have not implemented a positional shift in our cata- 
logues, although we highlight this issue in the table captions. 



4.1 Effective area 

In our catalogues, the selection criteria for radio emitters is de- 
termined by the local noise at the position of the source and by 
the effectiveness of SAD in detecting the sources. In particular, we 
note the noise across the image has a complicated structure and is 
correlated on several different scales. In order to find the effective 
area for source detection as a function of flux density, we mod- 
elled 25,000 point sources using the task IMMOD, with peaks from 
Ix to 500x the central r.m.s. in the mosaic (~ I5/iJybeam~^ at 
610 MHz and ~ 6 fijy beam~^ at 1.4 GHz). We performed the sim- 
ulations by introducing 500 mock sources into a residual image 
(without ^5ct positive or negative sources). We made use of two 
different (but complementary) residual images obtained from the 
negative (inverted) and the 'positive' (normal) maps. This allowed 
us to assess flux boosting due to random associations with faint, real 
sources. We then extracted a catalogue using SAD, with the same 
criteria as those described in fj3] Mock sources were introduced, 
taking into account bandwidth smearing, with random position 
angles, located >30 pixels from the image border, none of them 
overlapping. Bright residuals around powerful radio sources were 
avoided too - regions in the vicinity of the brightest 20 sources. 
The process of introducing sources was repeated 50 times. 



© 0000 RAS, MNRAS 000, 000-000 



The nature of the sub-mJy radio population 9 




Point Source Flux Limit (^iJy) 

Figure 6. Solid angle versus point-source flux limit. Solid and dashed lines 
are based on GMRT and VLA mosaics, covering a total of 3,534 and 
2,019 arcmin-^, respectively. 

Since mock sources were introduced randomly in the image, 
we can assume that the completeness function - the ratio of sources 
extracted to those injected - as a function of flux density, nor- 
malised to the area of the field, provides the effective area, AcS (5), 
which includes all possible biases from the SAD extraction and due 
to the noise structure of the map. 

Fig.[6]shows the solid angle versus point-source flux limit used 
in Eq. ([3} to estimate the differential number counts at 610 MHz 
and 1.4 GHz. 

4.2 Resolution bias 

In this work, most of the sources lie in the sub-mJy regime and the 
angular size distribution of the sub-mJy radio population is not well 
known. Given the low resolution of our GMRT and VLA maps, 
the vast majority of the sources are point-like (<4arcsec), there- 
fore based on our data we are unable to obtain a detailed and self- 
consistent angular size distribution. 

Source catalogues based on a PNR-threshold criterion are bi- 
ased against extended sources as a function of flux density. In order 
to estimate the fraction of sources being missed by our threshold 
criterion, we applied the following treatment. 

4.2.1 Area threshold 

Number counts are calculated as a function of integrated flux den- 
sity, while our catalogues are mainly based on a PNR threshold 
criterion. This restricts the detection of faint radio sources to those 
with small angular sizes. The resolution bias can be simply esti- 
mated using the following equation: 

F1UX[SAD] _ ^source _ / TT \ Bmaj[SAD] BmlnjSAD] 
Peak[SAD] ^bcam \41n2/ ^boam 

(4) 

where the [SAD] index indicate values from one single Gaussian 
from the SAD output fit; integrated 'Flux' is given in Jy, 'Peak' 
flux in Jybeam"^, Bmaj and Bmin are the fitted major and minor 
FWHMs in arcsec (not the deconvolved source size), and the synthe- 
sised beam area, Abeam = {j^) Bmaj'"=^™ Bmin'"'^™ arcsec^ 
Eq. l|4j implies that for our PNR threshold, the fitted area of a 
source as a function of flux density is restricted to: 



^source < ^bcam p^^^ 

5 X r.m.s. 

In particular, our PNR threshold implies the faintest sources 
are restricted to be detected with small fitted areas, ylsourcc ~ 
Abeam, or a little smaller than the beam given by the uncertainties 
in the fit. Note that Eq. ^ applies to sources fitted with a single 
Gaussian only - this is the case for the vast majority of the sources 
analysed in this work. 

4.2.2 Angular size distribution 

Previous studies have shown a decreasing angular size of the 
radio sources towards faint flux densities. An early study, 
I Windhorst et alj (1990) parameterised the angular size distribution 
of the radio emitters using the following equation: 

h{e) = exp[- ln(2) X (e/^med)"'''] (6) 

where h{9) is defined as the cumulative fraction of sources with 
angular sizes larger than 6 (the major FWHM) and ^mcd = 2 x 
S'i.Ighz arcsec is the median angular size as a function of flux den- 
sity (in mjy). This estimate p redicts small variat ions in ^mod as a 
functio n of radio flux de nsity. ISondi et"al] ( l2003i) . however, found 
that the Iwindhorst et al.l distribution yields a considerably higher 
number of sources with large angular sizes - by almost a factor of 
two (with 9 > 4 arcsec) than the observed in the sub-mJy regime. 
This translates into an over estimate of the sou rces expected to be 
missed in our observations. iBondi et alj ( l2003h found that the cu- 
mulative angular size distribution of sources with 0.4 ^ 5'l.4GHz < 
ImJy (expected to be unbiased for ^ 15-arcsec sources) is well 
described by: 

jd.er^ for .^4arcsec 

[e-^-^-O.Ol for 6l>4arcsec. 

High-resolution radio observations jMuxlow et al. I l2005l: 
iBiggs & Ivisoi] l2008l) . using data from both the Multi-Element 
Radio-Linked Interferometer Network (MERLIN) and the VLA, 
have given us an angular size distribution for the sources in the 
40/iJy < 5'i.4GHz S ImJy regime. Almost all the sources are 
resolved with angular sizes below 4 arcsec, which implies that our 
observations might be unaffected by resolution bias at fijy flux den- 
sities. Nevertheless. IMuxIow et all estimate their sample is 10 per 
cent incomplete based on previous observations with the Wester- 
bork Synthesis Radio Telescope (WSRT; with a synthesised beam- 
size of 15 arcsec, FWHM) in the same field, which we use as an 
upper limit for detections. 

We have adopted a cumulative angular size distribution given 
by Eq. ^ for bright (> 1 mJy) sources, and the average of Eqs ^ 
and ^ for fainter sources. 

4.2.3 Correction factors 

In order to relate Eq. ^ to the assumed cumulative angular size 
distribution, we have considered 6 = ^/ Asourec/l-1331/?7, where 
»7 is the median ratio between the major and minor FWHM (broad- 
ened by smearing effects) of the observed VLA sources, i] — 0.80. 
Although this is a strong assumption, changing this parameter does 
not significantly modify our results. 

In Fig.|7]we plot the expected missed fractions as a function of 
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Resolution Bias 




10^ 10^ 10 

Observed Flux ((LtJy) 



Figure 7. Estimated missed fraction of extended sources as a function of 
flux density. Solid and dashed lines are the estimates for tlie VLA and 
GMRT detections, respectively. Estimates are based on Eq. ^5) and an av- 
eraged cumulative size distribution based on Eq. ^6) and (7) - see il4.2.1l 
and i]4.2.2l We have assumed a spectral index of a = —0.7 for the GMRT 
predictions. 



flux density for both observing frequencies. Note, we have assumed 
a threshold of 1 per cent incomplete ness for our faintest radio flux 
levels, based on lMuxlow et al. Il l2005h . 

Since the angular size distribution of the 610-MHz sources is 
more uncertain than that of the 1.4-GHz sources, we have assumed 
a radio spectral index of a = —0.7 to calculate the missed fraction 
at 610 MHz. 

These correction factors are small due to the relatively poor 
resolution of our observations. Indeed, this bias is minimised when 
we include extended sources extracted from the convolved images 
(i ]3.4t . In particular, at 100 /iJy we predict that 3 per cent of GMRT 
sources and 6 per cent of VLA sources are not selected in our cata- 
logues. 

4.3 Differential number counts 

The differential number counts from Eq. ([3}, normalised for an Eu- 
clidean Universe, are plotted in Fig. [8] At both frequencies we have 
used the effective area shown in Fig.|6]and the correction for reso- 
lution bias plotted in Fig. |7] The flux density used to multiply the 
differential number counts is given by the bin centre (in log space), 
and errors are Poissonian for uncorrected counts, for both the ob- 
served and mock sources. Tables|5]and[6]present the counts. 

We observe a flattening in the Euclidean differential num- 
ber counts towards sub-mjy flux densities at both 610 MHz and 

1.4 GHz (Fig. [§}. We find evidence for a second peak in number 
counts at ~ 80 /iJy and ~ 200 ^Jy for the VLA and GMRT counts, 
respectively. These provide constraints on the contribution of IRAS- 
like sources to the sub-mJy radio fl uxes, based on population syn- 
thesis models faopkins et al.ll2000l) . The appearance of these fea- 
tures at sub-mJy radio fluxes is traditionally explained as a transi- 
tion from a dominant bright radio-loud AG N population to a star- 
forming and radio-quiet AGN populations dWindhorst etalJI 19851 : 
ISimpson et al J2006l : ICond"oi]l2007.) . 

In this work we extend the number counts down to very faint 
610-MHz flux densities whilst maintaining complete agreement 
with previous studies at higher flux levels. Our results at 1.4 GHz 
are in good agreement with prev ious observations at > 200 /jJy us- 
ing the VLA's B configuration jSondi et alj|2003l : ISevmour et al.l 
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Figure 8. Differential source counts as a function of flux density in the 
Lockman Hole at 6 10 MHz (top) and 1.4 GHz (bottom), normalised by the 
value expected in a static Euclidean Universe. Errors are assumed to be 
Poissonian ( Gehrels 1986) and are combined in quadrature for the observed 
number of sources in the bin and the mock source simulations described in 
^431 At 610 MHz, we plot data from previous studies: the VLA-VIRMOS 
Deep Field jBondi et al.l |2007|) . the l'' XMM-Newton/C handra Survey 
JMoss et al.ll2007l) . a compilation from lGam et al ] )2008bl) which includes 
data from the ELAIS Nl, Lockman Hole a nd Spitzer First-Look Survey 
fields and counts obtained using the WSRT jValentiin et aljl 19771 : iKatgerll 
[l97i;[Valentiin 1980; Katgert-Merk eliin et alJI 19851) . We also show previ- 
ous 1.4-GHz counts, based on studies using the VLA and u sing the Aus- 
tralia Telescope Compact Array: Th e Deep Swire Field jOwen & Momsoiil 
2008). the Phoenix Deep Survey I Hopki ns et Zl l2003h , VLA-VIRMOS 
Deep Fiel d iBondi et a l 2003 ), the llt'^XMM-Newton/ROSAT T)cs-p X-ray 
Survey |Sevmour et al]|2004l) and the SahaiulXMM-Newton Deep Field 
iSimpson et alj2006l) . 



I2OO4!) and the Australia Telescope Compact Array jHopkins et al.l 
2003). At fainter levels, < 100 /xJy, our 1.4-GHz counts are a lit- 
tle higher than the majority of previous estimates. This may re- 
flect underes timates of number counts based on shallower images; 
for example, lOwen & Morrisoiil 1 I2OO8I) reported an approximately 
flat logA''-logS' distribution down to Si.4ghz ~ 15/iIy, exploit- 
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Table 5. The 610-MHz radio source counts. 
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Table 6. The 1 .4-GHz radio source counts. 
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11 
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ing an extremely deep VLA 1.4-GHz image of the 1046+59 field 
(r.m.s. ^ 3 /iJy). The decrement of the number of radio sources to- 
wards faint flux densities (Fig. [8j is highly dependent on the ef- 
fective area, Acs, in which detections are possible (Fig. [6l(. We 
find Acfi always decreases slower in relation to the number of 
sources in each bin, so underestimating our number counts would 
require us to have undercorrect ed for resolution bias. We note, 
however, that even adopting the I Windhorst et"ZI i ll 9901) size dis- 
tribution could not increase the number counts to those found by 
lOwen & Morrisonl . 

A large number of faint radio sources was suggested recently 
by the Absolute Radiometer f or Cosmology, As trophysics and Dif- 
fuse Emission (ARCADE 2 - iFixsen et al.l2009b experiment where 
an excess brightness temperature was found in the 22 MHz-10 GHz 
range, where the sky is expected to be dominated by synchrotron 
and free-free emi s sion f rom extra-galactic sources and the Milky 
Way. iFixsen et alj ( |2009|) reported seeing ap proximately 5 x the ex- 
pected contribution from faint radio sources dOervasi et alj2008) to 
the cosmic microwave background, which sets a useful limit for the 
total number of radio sources. 

The origin of the wide scatter in reported 1.4-GHz number 
counts is controversial. It is possible that for < 500 /iJy sources 
our imaged area is not big enough to average out cosmic struc- 
ture. For a field subtending an angle of one square degree, the 
angular d iameter d istance at redshift unity is only ~20Mpc. We 
note that ICondonI {2003) estimated a count fluctuation of only 
(J = (1.07 ± 0.26) N^^^ based on 17 non-overlappin g fields in the 
Spitzer First-Look Survey (FLS - ICondon et al.l2003h . where 
is th e statistical fluc tuation expected without clustering. Based on 
this. lCondonI ( |2007|) stated that most of the variance reported in the 
literature is 'mundane, not cosmic', thereby concluding that years 
of debate have been devoted to differences induced by different 
instruments and analysis techniqu es (see i ]7.2t and possibly - in 
more candid terms - human error. iBiggs & Ivison (2006) came to 
a similar conclusion, finding around double the sou rce coun t in the 
Hubble Deep Field North as had been measured bv iRichardj ( |2000|) 
and tracking the problem to a simple arithmetical error rather than 
any fundamental problem with the data or their reduction. 

Another possible origin for some of the reported scatter may 
be the use of different VLA configurations for deep survey work, 
either through problems setting the absolute flux scale, or via inade- 
quate correction for bandwidth smearing. We have separated the A- 
and B -configuration data centred on our LOCKMAN-E VLA point- 
ing to test if this introduces notable differences. We find a value 
of Sa/Sb ~ 1.03 ± 0.21 for the mean and standard deviation of 
the flux density ratios observed on sources detected in both con- 
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figurations with peak-to-noise ratios greater than 10 (42 sources). 
We conclude that use of different configurations does not signifi- 
cantly bias estimates of flux densities in our catalogues. The scatter 
must originate elsewhere, assuming that data have been calibrated 
carefully, using appropriate uv restrictions or calibrator models. 

Noise can contaminate the number counts through the inclu- 
sion of spurious sources and by boosting intrinsically faint sources 
to higher flux densities. We have tested the possibility that spuri- 
ous sources contaminate the samples by implementing an identical 
source extraction procedure for the inverted (negative) signal map 
and we are confident that any contamination by spurious sources is 
below 5 per cent, even in the faintest flux density bins. 



5 SPECTRAL INDICES 

We have cross-matched (within 7 arcsec) the GMRT and VLA radio 
catalogues in order to estimate the spectral index of the radio emit- 
ters. This measurement yields evidence for the synchrotron mech- 
anism which dominates the observed radiation from the sub-mJy 
radio population. 

In Fig. |9]we show the spectral index between 610 MHz and 
1.4 GHz as a function of flux density. We now analyse the spectral 
indices based independently on GMRT- and VLA-selected sam- 
ples, where a 610-MHz-selected catalogue naturally tends to prefer 
the detection of steep-spectrum sources while selection at 1 .4 GHz 
favours flatter spectra. 

Since the VLA and GMRT images have different resolutions, 
special care must be taken when we analyse results based on differ- 
ent frequency-selected samples. The 610-MHz catalogue, from a 
lower-resolution image, tends to have more counterparts per source 
than the 1.4-GHz catalogue. This issue confuses the statistical stud- 
ies of spectral indexes since more unrelated galaxies are summed 
up or split depending on the sample criterion. For example, given 
our selection criteria and the resolution difference between the ob- 
serving frequencies, it is possible that a single source at 610MHz is 
related to two single detections at 1 .4 GHz. In this case, the spectral 
index of the GMRT source is calculated using the sum of the flux 
densities from both VLA sources. For the VLA-selected sources 
we have divided the flux density at 610 MHz based on the relative 
contribution from each VLA source (spectral indexes based on split 
sources are flagged as such). Where there is no clear close counter- 
part (within 7 arcsec), 5-ct upper/lower limits are calculated using 
the local r.m.s. ( ii3.U . weighting by the source to beam area ratio. 

Fig.|9]shows the spectral indices for GMRT-selected (top) and 
VLA-selected (bottom) samples. Both distributions show a large 
scatter, casting doubt on previous studies which assume a clean 
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Figure 9. Spectral index, ctf^lo^^ = —2.77 X log(S'6ioMHz/'S'i.4GHz). as a function of flux density, based on a 610-MHz-selected sample {top) and a 
1 .4-GHz-selected sample {bottom). Dashed and solid lines correspond to the primary and secondary point-source flux limit for each survey. Plus symbols 
represent detections. Arrows represent upper and lower limits for GMRT and VLA sources, respectively. Diamonds represent unreliable spectral indexes - 
sources that were split, missing fainter components, or large offsets in the cross-match. The data at a = —3 are sources outside the overlapping region. 



star-forming galaxy population with a single spectral index in the 
sub-mjy radio regime. 

Upper/lower limits dominate at the faintest fluxes, partly due 
to the difficulty in obtaining counterparts so close to the detec- 
tion threshold, where the catalogues are incomplete, and partly due 
to the tendency to detect steeper-spectrum sources at 610MHz or 
flatter-spectrum sources at 1.4 GHz. In our work, this bias does 
not allow the study of spectral index for sources with Siaght. S 
100 fiiy. The larger number of lower limits at 1.4 GHz reflects the 
extra depth of the VLA imaging. 



In Table|7]we show the observed statistical results for spectral 
indices as a function of flux density (parenthesis show mock values 
- see below). In order to avoid a large fraction of upper limits in 
these statistical calculations, we have used only ^lO-a (PNR) de- 
tections for each of the catalogue-based samples, but down to 5-a 
for the counterpart. We find no trend in the distribution of spec- 
tral indexes toward fainter flux densities in either the GMRT- or 
the VLA-selected catalogues. Since these two samples tend to se- 
lect spectra with different spectral indexes, numerical differences of 
~ 0.2 in the mean a, and ~ 0. 1 in the median, are seen in the esti- 
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Figure 10. The median- bootstrap (in black) and the Kaplan-Meier 
^Feigelso n & Nelsorj[T983) mean (in red) spectral indices as a function of 
flux density. Data are based on 610-MHz- and 1.4-GHz-selected samples 
and shown in Table[2] In the image, mean values are slightly shifted in flux 
density (just for clarity). 



mates for similar flux density bins. These results contradict the sug- 
gested flattening in spectr al index at sub-mJy radio flux densit ies 
quoted in previous studies teondi et al.ll2007l ; lGarn et alj2008ah . 

The almost constant mean and median values of a in the 
sub-mJy regime (certainly between 0.1 — lOmJy) is a robust 
result, and suggests the sub-mJy radio population is dominated 
by optically-thin synchrotron emission from star-forming galax- 
ies and/or from steep-spectrum lobe-dominated FR AGN. We show 
the trend for the mean and median spectral index as a function of 
flux density in Fig. [TO] The mean (in red) includes upper/lower 
limits using the Kaplan-M eier product-limit estimator (ASURV - 
iFeigelson & Nelsonlll985h . and the median (in black) is obtained 
from detections using a bootstrapping approach. The small varia- 
tions in these estimates reflect the non-Gaussianity and the large 
scatter of the distribution, especially for the faintest detections. In 
particular, the very steep spectral index obtained from the faintest 
GMRT flux bin contains ~20 per cent of upper limits, a result 
which may be slightly biased toward steeper spectral indexes (see 
simulations in parentheses from Table. |7}. 

The scatter of the af-icSi distribution is CTq ~ 0.4 in 
the sub-mJy regime, which suggests the detection of a large va- 
riety of populations - probably a s ubstantial number of syn- 
chrotron self-absorbed AGN cores ( iBlundell & Kuncid l2007l : 
[Snellen et alj|2000.) and h igh-redshift ultra-steep-spectrum (USS) 
sources jjarvis et alj|200lh . 

Taking into account the point-source simulations from i]4.1l 
we have estimated the distribution of spectral indices based on vari- 
ations between injected/extracted flux densities. Assuming a radio 
population with a = —0.7, and taking into account the uncertain- 
ties in the source extraction process, we predict that the spectral 
index distribution should broaden towards faint fluxes, reaching a 
scatter similar to that observed in our faintest flux density bins (see 
Table I7J. These simulations are presented in parenthesis next to 
the observed results, and imply that the broad distribution of spec- 
tral indices is intrinsic above S'i.4ghz ^ 100 /iJy but dominated 
(broadened) by the fitting uncertainties (trumpet-like) at fainter flux 
densities. 

Based on all sources, 6 per cent (<13 per cent) and 6 per 
cent (<9 per cent) have q > in the GMRT- and VLA-selected 



catalogues, respectively (the fractions in parenthesis include up- 
per/lower limits). Inspection by eye of these flat-spectrum sources 
reveals them to be compact at both wavelengths, brighter in the 
VLA image than in the GMRT image, probably due to syn- 
chrotron se lf-absorption in compact (^1 kpc) GHz-peaked sources 
jSnellen eral...2000. ) which are believed to be young FR II sources. 



6 X-RAY IDENTIFICATIONS 

Deep, hard X-ray observations (in the 2-10 keV band) p rovide an 
efficient method for identifying AGN jMushotzkvll2004l) , at least 
with column densities TVh < 10^* cm~^ (i.e. those that are 'Comp 



ton th in'). Heavily absorbed AGN are common iMaioIino et al.l 
Il998l) and are expected to be re sponsible for the bulk of the cos- 
mic X-ray background (CXRB: [Ueda et alj|2003l ; iHasingerlbOoi : 
lGiUietal.l f2007). Indeed, a large (~20-50 per cent) fraction is 
believed to be Compton thick (TVh > 10^* cm~^), a population 
missed by even the deepest X-ray observations. Given that radio 
observations are unaffected by obscuration, this suggests that deep 
radio surveys may provide a method to find this missing population 

- the so-called QSO-2s. 

In order to probe the nature of the sub-mJy radio population 
we have cross-matched our VLA 1 .4-GHz catalogue (Tabled with 
the d eepest XMM/Newton image so far published jBrunner et alj 
l2008l) . The X-ray field has an area of 0. 1 96 deg^ (which is entirely 
covered by our VLA mosaic) and an effective exposure time of 
637 ks. The X-ray catalogue contains 409 sources above a likeli- 
hood of 10 (3.9(7) of which 266 and 340 are detected in the hard 
and soft X-ray band (^-lokov 5S 9 x 10~^® erg sec~^ cm~^, 
S'o.5-2kcV J? 1.9 X 10~^^ erg sec~^ cm~^), respectively. The con- 
tamination by spurious sources in the X-ray catalogue is expected 
to be only ~1 per cent. 

In Fig.ll IKZeff) we show the fraction of radio sources detected 
in the hard X-ray band - a good indicator for AGN activity - as a 
function of radio flux density. 32 per cent of the hard X-ray sam- 
ple (85 sources) have a > 5(t (PNR) I.4-GHz detection, within 
Sarcsec. This fraction declines from ~30 per cent at ~ImJy to 
~I0 per cent at < 100 /iJy. A large number of faint radio sources, 
S'i.4GHz < 300 /.iJy, that are det ected in the ha r d X-r ay band. 
This has been previously noted by 'Simps on et alj ( l2006h . though 
in terms of the relative /racfto« they remain a minority. 

The deeper radio and X -ray catalogues used in this work 

- compared to those used by ISimpson et alj (where < 5'i.4ghz ^ 
100 M Jy and S'2-iokeV 3 x 10"^^ ergsec"^ cm"^ - lUeda et all 
|2008^ - do not show any clear evidence for a significant in- 
crease in the fraction of sources detected in X-rays at faint radio 
fluxes. Indeed, when we bin the X-ray data we find that between 
S'2-iokoV = lO"^'^ - 10"" ergsec"^ cm"^, the fraction of X- 
ray sources detected in the radio image is relatively constant, ~35 
per cent. For the faintest X-ray sources, only 25 per cent are de- 
tected in the radio image. We note that the use of a deeper, unpub- 
lished XMM/Newton catalogue (Mat Page, private communication) 
does not significantly modify the statistics at the faintest flux den- 
sities but increases (to 50 per cent) the fraction of counterparts at 
Si.dCHz ^ 1 miy. 

The Lockman Hole is a popular legacy field and a large num- 
ber of the X-ray sourc es have been sp ectroscopically classified (e.g. 
iLehmaim et alJ uOOl ; Szokolv et al. 2005, in preparation). We use 
the compilation of Brunner et al. (2008) to plot the hard X-ray and 
1 .4-GHz flux densities (Fig.[TT] right). Overplotted in the figure, we 
show the expected correlations between radio and X-ray fluxes for 
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Based on the 1.4GHz-VLA catalogue 

1.4 GHz Flux Bin (mJy) (^^^'^GH." ) AS "rV ((<^> ± )„ormal {(<^>±'^c»)biwoigl,t <("!'i'Gi?/)>boot=trap 

3.00 - 20.00 -0.59 ± 0.06 -0.59 ± 0.26 (-0.68 ± 0.02) -0.67 ± 0.18 (-0.68 ± 0.02) -0.63 ± 0.05 (-0.68 ± 0.00) 

0.90 - 3.00 -0.48 ±0.11 -0.48 ± 0.59 (-0.70 ± 0.04) -0.64 ± 0.50 (-0.70 ± 0.04) -0.60 ± 0.09 (-0.70 ± 0.00) 

0.30 - 0.90 -0.51 ±0.07 -0.61 ± 0.48 (-0.68 ± 0.18) -0.67 ± 0.29 (-0.69 ± 0.17) -0.65 ± 0.03 (-0.69 ± 0.00) 

0.10 - 0.30 -0.48 ± 0.03 -0.57 ± 0.43 (-0.56 ± 0.55) -0.59 ± 0.41 (-0.64 ± 0.45) -0.60 ± 0.02 (-0.65 ± 0.00) 



Table 7. Statistical properties of radio spectral indices as a function of flux density. These estimates are based on confident measures of the spectral indices, 
i.e. with no flags in Table |3] and |4] For both the 610-MHz- and 1 .4-GHz-selected samples we use only ^IO-ct (PNR) detections (but down to 5-cf for the 
counterpart) to minimise the uncertainties from upper/lower limits. Columns: (1) the flux density bin; (2) the mean value given by the Kaplan-Meier product- 
limit estimator (ASURV - Feigelson & Nelson 1985) which takes into account the upper hmits in the bin; (3) the mean and standard deviation of the spectral 
index distribution based on detections o nly; (4) the c entral location (mean) and scale parameter (sigma) based on the biweight estimator which is resistant to 
outliers and non-Gaussian distributions iBeers et"al]IT990i) ; (5) the median value of the distribution based on a bootstrap approach. The parentheses shown in 
some of the columns are the statistical estimates based on a single a = —0.7 population with input/extracted flux ratios from the mock sources described in 
i]4. II These constitute a useful check on the reliability of the results, e.g. whether the observed scatter is intrinsic or dominated by errors. 
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Figure 11. Left: The fraction of radio sources (at 1.4 GHz) detected by XMM/Newton in the hard X-ray band. Errors are assumed to be Poissonian. Right: 
Radio flux density vers us X-ray flux density for cross-matched sources. The solid and dashed lines conespon d to the expected coiTelation for radio-quiet 
AGN terinkmann et al . 2000) and star-forming galaxi es I Condon 1992; Ranalli et al. 2003) introduced by Simp son et alj )200d) , res pectively. The differe nt 
populations are based on spectroscopic identifications iLehmann et alj e.g. 1200 lUSzokolv et alj e.g. 120091 in preparation) compiled bv lBrunner et alj l2008l) . 



starbursts (ICondonlll992l : lRanalli et"ar]|2003l) and radio-quiet AGN 
jBrinkmann et al.l200Cll) as dashed and solid lines, respectively. De- 
viations from these correlations can be produced by photoelectric 
absorption (lower X-ray fluxes) or by jets oriented close to our line 
of sight (larger radio fluxes). 

We find that in the sub-mjy radio regime, the vast majority of 
the radio-quiet AGN (types 1 and 2) have 52-iokeV ^ 3 x 10"^^ 
erg sec~^ cm^^. This tallies with ISimpson et all 's criterion for 
identifying radio-quiet AGN, based on those sources lying closer 
to the solid than the dashed line in Fig. [TT] (ng/z/)- We find that 
the contamination produced by galaxies at fainter X-ray flux den- 
sities could be very large, since most of the spectroscopically iden- 



tified galaxies lie at .Sg- iokeV 4 x 1 0~^° erg sec~^ cm~^ 
- also in agreement with ISimpson et al] 's criterion. Due to pho- 
toelectric absorption, the distribution of type-2 AGN has a large 
scatter in the hard X-ray band - they are typically found with ra- 
dio fluxes, S'i.4GHz ^ 100 /iJy. It is interesting to note the ra- 
dio emission, at the ~100-/iJy level, in two X-ray clusters (XM- 
MUJ105339.7-I-573520, XMMUJ105346.4-I-573510) and one clas- 
sified group (XMMUJ105318.9-I-572044). 

From Fig. [TTI {right) we find that the solid line 
terinkmann et al.l 's correlation) appears to define an upper 
limit for the AGN population. The well-known radio-loud AGN 
population with > 1 mJy tends to agree better with the star-forming 



® 0000 RAS, MNRAS 000, 000-000 



The nature of the sub-mJy radio population 15 



1 .0 

J 0.8 



1 1 1 1 

X-Ray Detected 


All radio 










J" 

, , , 1 , 





-3-2-10 1 2 

610MHz 
^ 1 .4GHz 

Figure 12. The radio spectral index between 610 MHz and 1 .4 GHz for the 
sample of radio sources detected in the hard X-ray band. 

galaxy correlation {dashed line). Finally, as the completeness func- 
tions indicate - top and right axes - the faintest radio/X-ray 
sources are observed in different areas of the map. 

We have estimated the total fraction of radio-quiet AGN in 
the sub-mJy radio regime based on four assumptions: (1) the X- 
ray catalogue contains almost all the type-1 AGN in the redshift 
range of the radio sources (mean, z ~ 0.8); (2) to estimate an up- 
per limit the spectroscopically-identified sources maintain the same 
relative fractions in the unknown population as in Fig.lllKn'g/;?'): 

(3) a constant fraction of type l/type2 AGN =1:4 based on X-ray 
observations of local Seyfert galaxies (e.g. iMaiolino et al.lll998h : 

(4) 25 per cent of the X-ray sources are Compton-thick (undetected, 
obviously) based on synthesi s population models to reproduce the 
CXRB (e.g.'Ueda et al]|2003l) . 

We find that the number of classified type 1 AGN detected 
in both 1.4-GHz and X-ray wavebands is 21 (16 in the hard; 
8 in the soft; 3 in both). These sources comprise 38 and 31 
per cent of the spectroscopically classified samples (with ra- 
dio detections) in the hard and soft X-ray bands, respectively. 
Considering the assumptions described above, we expect ^131 
(21 X [1 + 4] X [1 + 0.25]) AGN in our radio sample, i.e. ~20 
per cent of the 755 radio sources in the region covered in the X- 
ray waveband. Fig. [TT] (right) clearly shows that most of the X- 
ray detections are at S'i.4ghz Si 300 /iJy, therefore applying the 
same treatment we estimate that radio-quiet AGN compose a 19- 
37 and 19-30 per cent fraction of the S'i.4ghz < 100 /^Jy and 
100 /^Jy ^ 5'i.4GHz < 300 /iJy radio population, respectively 
(upper limits based on the second assumption above). No strong 
variations for the content of radio-quiet AGN as a function of radio 
flux density are seen. These rough estimations are a little higher 
than previous lsimpson et al.l 's estimate, i.e. 10-20 per cent. 

These results suggest a transition, at sub-mJy radio flux levels, 
from a bright and powerful AGN to a dominant star-forming galaxy 
population, contaminated at the ~ 25 ± 10 per cent level by radio- 
quiet AGN. 

Of the 84 hard X-ray sources detected at 1.4 GHz, 48 have 
a reliable radio spectral index (PNR > 5 in both 610 MHz and 
1.4 GHz images). In Fig.[T2]we plot al^^aS^ 

for the X-ray sources 

alongside those of the entire VLA sample. For the radio/X-ray 
sources, we find a median spectral index of —0.63 ± 0.04, slightly 



Class 


N 


(("1.4GHz >> 


Type 1 AGN 


9 


-0.79 ± 0.20 


Type 2 AGN 


8 


-0.60 ± 0.12 


Galaxy 


6 


-0.65 ± 0.26 


Unknown 


24 


-0.58 ± 0.07 


Cluster/Group 


1 


-1.87 



Table 8. The median (bootstrap) radio spectral index for the spectroscopi- 
cally identified hard X-ray sources plotted in Fig. llll (rigfa). 



flatter than that of the whole 1.4-GHz sample (-0.70 ± 0.01). A 
Kolmogorov-Smirnov test gives a probability of 16 per cent that 
both populations come from the same parent distribution. The spec- 
tral indices for the spectroscopically identified X-ray populations 
are presented in Table [S] Given that most of the sources are found 
at faint radio fluxes, these values may be highly biased by incom- 
pleteness at 610 MHz. 

We conclude that radio-quiet AGN are no more numerous 
than star-forming galaxies at faint flux densities. The fraction of ra- 
dio sources harbouring an AGN decreases towards faint radio flux 
densities - a transition from radio-loud AGN to a dominant star- 
forming galaxy population at sub-mJy radio fluxes. 



7 DISCUSSION 

7.1 Comparison with a previous 610-MHz survey 

iGam et al. I ( l2008bl) published a 6-ct 610-MHz catalogue covering 
5deg'^ in the Lockman Hole using GMRT. This allows a d irect 
comparison with our detections at 610 MHz. The iGarn et alj mo- 
saic (aeioMHz ~ 60 /iJy beam~^) covers the north-western portion 
of our GMRT mosaic, with ~ 0.6 deg^ in common. Cross-matching 
their catalogue with ours reveals good agreement in flux densities 
for the brightest detections. However, we find a very large num- 
ber of spurious, fai nt sources in their catalogue. In Fig.[T3](top) we 
show the fraction of lGam et al J sources which are recovered (within 
7 arcsec) in our 4x deeper image, as a function of thei r radio flux 
densities. Given the depth of our image, we expect all iGam et al.l 
sources should have a counterpart in our catalogue, but no hints 
of emission in the image (Fig.[T3]- bottom) are found for the vast 
majority of their % 3mJy (PNRs 10) sources. We conclude that 
iGam et al.l b-Qm^ ) have not provided a reliable 6-cr catalogue. It 
is interesting to note, however, that the number counts presented in 
Fig.[8]are in agreement with ours, probably because they base their 
number counts considering bright sources (> IO-ct) in the cleanest 

regions of their map only. 

Looking at the full iGam et al] Lockman Hole image, their 
sources lie mainly near the edge of their pointings, where the 
noise levels are greatest. The cleanest areas of their image con- 
tain very few sources, suggesting that their source extraction did 
not utilise knowledge of the local nois e level. The distri bution of 
61 0-MHz sources in the Spi tzer FLS jOam et alj|2007l , as used 
by iMagliocchetti et al] 1200 8l) appears similar, with a dearth of 
sources in the deepest p ortion of the mosaic (see figure 1 of 
Magliocchetti et_alj 120081) . This sug gests that some of the faint 
steep-spectrum emitters seen by Ma gliocchetti et akl may be spu- 
rious (by random association), and that their spectral index distri- 
bution is likely broadened at faint flux densities. 
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Figure 13. Top.- Fraction of iGam et all J2008lj) sources recovered in our 
image as a function of their 610-MHz flux density. Bottom: An area of 12 X 
12arcmin^ from our 610-MHz mosaic shown in grey-scale with a linear 
stretch from 0-200 /jjy beam" ^ . Crosses show source positions catalogued 
by Gam et aU t2008b.^ . We expect detections of all their sources since they 
have 5C|-°Hz > 300 /.Jy. 



7.2 Comparison with a previous 1.4-GHz catalogue 

iBiggs & Ivisoi] iim& i used some of the VLA data utilised in this 
paper. They present a high-resolution (A-configuration) map with 
a ~ 1.3-arcsec beam (FWHM), and a different method of source 
detection. We find our work yields slightly higher flux densities. 
The mean flux density ratio, ST/GHz/^^^iiGHz' is 1-16 ± 0.02 
(88 sources) with a standard deviation of 0.15 for cross-matched 
sources with peak-to-noise ratio higher than 15. We have demon- 
strated in ^4. 3 1 that this difference is not produced by calibration 
problems from adding A- and B-configuration data. A detailed 
analysis of both samples has been carried out and we have found 
that differences in flux are pro duced by the different a pproaches to 
source extraction. In particular. lBiggs & Ivisorj | |200Q) used a fixed 
beam size to fit a Gaussian to sources which were first extracted 
with areas smaller than the beam. This results in lower measured 
flux densities, as seen in Fig. [14] Reducing all our 1.4-GHz fluxes 
by a factor 1 . 1 6 x would decrease the observed spectral indices (nu- 
merically) by 0.18 (i.e. the spectra become steeper), but we stress 




10' . 



10' 



10^ 10^ 10 

Flux density (/LtJy) from this work 



Figure 14. Integrated flux densities for the sources detected in the 
Lockman Hole at 1. 4 GHz in comparison with a previous work by 
iBiggs & Ivisoiil <200d) . Sources have been cross-matched using a search 
radius of 4.2 arcsec. 



that this would not affect the (absence of) trend in the spectral in- 
dex distribution towards faint flux densities, nor the width of the 
observed a distribution. 



7.3 Previous spectral index studies 

The radio spectral index observed between 610 MHz and 1.4 GHz 
has been controversial sin ce the earliest studies. Using the WSRT, 
iKatgert & Spinradl ( fT974l) found - from a small sample of sources 
with SeioMHz ^ 10 mjy - a spectral index distribution, af^icH^^ = 
—0.52 ± 0.39, an unusual result with a broad distribution with re- 
spect to higher frequency surveys. A muc h larger samp le gave a 
similar result: af:2^^i = -0.68 ± 0.31 l lKatgertlll979l) , statisti- 
cally in agreement with the previous work, but showing clear evi- 
dence for more complex than single power law spectra. Our survey 
covers only ~ 1 deg^ and therefore contains a small number (18) 
of sources in the SeioMHz ^ lOmJy range. From these sources, 
12 have got a reliable radio spectral index from which we find 
a mean (q) ~ —0.71 and standard deviation of Gc, ~ 0.17, in 
agreement with early results but with a considerably tighter dis- 
tribution. Since th ese sources are most ly powerful steep-spectrum 
radio-loud AGN jHopkins et al 1 12000) . we have also compared 
these spectral indexes with a sample of 2 < 0.5 FR II sources 



(<^750Mlfz = 0.79 ± 0.14, mean and scatter), finding good agree- 
ments as well jLaing e t al. 1983). 

With the advent of the GMRT, a variety of spectral index re- 
sults have appeared in the literature. They cover different flux den- 
sity ranges, so we present in Table|9]a comparison using the same 
ranges. W e find good agreement - within the errors - with the es- 
tima tes of Bondi et aU ( 2007) and lTasse et al.l ( l2007h . although not 
with lGam et al.l ( [2008ah . probably because of faint spurious associ- 
ations (discussed in or incompleteness in their estimations. 
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Flux riuige (nijy) 


\\"l.4GHz // 


//^610MHz\\Ours 
\\"l.4GHz // 


0.5 < Sl.4GHz 
U.iO < oi.4GHz < U.o 
0.1 < Si.4GHz < 0.15 


-0.67 ± 0.05(1) 

n A(\ -U n nQ(l) 
— U.4D it U.Uo^ ^ 

-0.61 ± 0.04(1) 


-0.63 ±0.03 

n fiQ -U n no 
— U.Do it U.Uz 

-0.59 ±0.03 


1.0 < S'eioMHz 
0.5 < SeioMHz < 1.0 


-0.56 ±0.04(2) 
-0.36 ±0.12(2) 


-0.69 ±0.04 
-0.71 ± 0.04 


30 < SeioMHz 


-0.76(3) 


-0.71 ±0.17 



Table 9. A direct comparison of our results wit h previous w ork on tlie 
spectral index betw een 610 MHz a i id 1.4 GHz: f n iBondi et al] iioO?): (2) 
iGametaljfeoOSd) : O'j iTasse et all fcOOVl) . The double pai'entheses enclose 
median values, where we estimate our errors using a bootstrapping analysis. 



7.4 The nature of the sub-mJy radio emitters 

There is little agreement in the literature concerning the optical 
properties of sub-mJy radio sources. Some studies support the 
idea that faint radio emitters are primarily star-forming galaxies. 
Th e spectroscop i c clas sification of Si.4ghz ^ 40-/iJy sources 
by iBarger et alj ( l2007b shows that the dominant population has 
stron g B aimer lines and no broad or high-ionisation lines. Like- 
wise, iBondi et al. I( l2007b find that late-type starbursts dominate the 
S'i.4GHz ^ 1 00 /xJy regime based on analysis of an optical colour- 
colour plot jCiliegi et al. 2Q05|) . Work based on a morphologi- 
cal classification in the optical jPadovani et alJllOOTh shows that 
star-forming galaxies comprise only about a third of the sub-mJy 
popul ation. More rad i cally, based on another optical colour-colour 
study, ISimpson et J] ( 12006 ') presented evidence for no change in 
the composition of the radio source population towards faint flux 
densities, arguing for a dominant passively-evolving massive ellip- 
tical galaxy population at all flux levels, 5'i.4ghz ^ 100 /iJy. On 
the other hand, at ~ 100 /iJy, bright submillimetre-selected galaxies 
- which are very clearly dominated by star formation - make up a 
significant number of t he optically faint radio emitters jivison et al.l 
l2002l:|Pope etai]|2006h . 

The approaches taken by all these studies reflect the difficulty 
of disentangling star-forming galaxies from nuclear AGN activity. 

We note that the large scatter {oa ~ 0.4) seen in Fig. |9] for 
the spectral indices suggests a more complicated scenario than a 
simple star-forming galaxy population (e.g. see Fig.[TT] right). We 
have found that the use of radio spectral index as a probe of a 
galaxy's nature is highly degenerat e: supernova remnant s and nu- 
clear activity are closely related ( Gebhardt et al J 12000): redshift 
effects may be combined with synchrotron losses, steepening the 
spectra jjarvis et"ai]|200ll) and - given the poor resolution of our 
images - spectral indices cannot be obtained for resolved compo- 
nents. Therefore, a galaxy's nature is difficult to disentangle using 
only the radio spectral index. Nevertheless, in this work we find that 
the sub-mJy radio population is characterised by optically-thin syn- 
chrotron emission, contaminated at the ~ 25 ± 10 per cent level by 
radio-quiet AGN - based on X-ray detections and previous spectro- 
sc opic classifications - in rough agreement with a previous study 
bv lSimpsonetal.l ( l2006l) . 



8 CONCLUDING REMARKS 

We have observed the Lockman Hole field using the GMRT at 
610 MHz and the VLA at 1.4 GHz, obtaining two deep radio im- 



ages with similar spatial resolutions and well-matched noise levels 
(15 and 6 /ijy beam~i, respectively) - the former representing the 
deepest GMRT image yet published. The data reveal a flattening 
followed by a second peak in the Euclidean-normalised number 
counts in the sub-mJy radio regime - evidence for the appearance 
of a different radio population dominating these faint flux densities. 

We discuss the reliabili ty of the 610-MHz catalogue pre- 
sented by iGam et al] ( l2008bh in the Lockman Hole, finding that 
their catalogue is highly contaminated by spurious sources, with 
similar problems apparent in their previous Spitzer FLS catalogue 
(iGam et al..2007) . This may have in fluenced the detection of st eep- 
spectrum radio emitters reported bv lMagliocchetti et alj ( 1200 8l) . 

We study the spectral index of the radio emitters by combin- 
ing our GMRT and the VLA measurements. The GMRT data are 
about 4x times deeper than previous imaging, allowing for clean 
results and avoiding the well-known bias for steeper- or flatter- 
spectrum sources in samples selected at longer or shorter wave- 
lengths, respectively. Analyses show evidence for large capabili- 
ties from GMRT observations for imaging steep-spectrum steep- 
spectrum sources in the field. Indeed, approximately 30 per cent 
of the GMRT sources are undetected at 1.4 GHz, revealing the 
sensitivity of deep 610-MHz images to faint, high-redshift star- 
forming galaxies such as those detected in submillimetre surveys 
(e.g. Ivison et al. 2007). 

Our results, based on ^lO-cr selection criterion, show that 
the mean and median spectral index does not evolve as a func- 
tion of radio flux density, certainly between ~100 /iJy and lOmJy 
at 1.4GHz. We find Oi^'ghz^ ~ -0.6 to -0.7, which suggests 
that optically-thin synchrotron emission is the dominant emis- 
sion mechanism in the sub-mJy population. Th e two most prob- 
able contributors are star-forming galaxies and iFanaroff & RilevI 
sources, ruling out a possible dominant flat-sp ectrum population 
(AGN-cores; GPS ) at these faint flux densities dBondi et aLll2007l: 
iGarn et al.ll2008ah . 

We find the distribution of spectral indices has a signifi- 
cant scatter (ctq ~ 0.4), which suggests a complicated scenario 
where different populations mix together. The fraction of inverted- 
spectrum (a > 0) sources is just 6 per cent (<11 including lower 
limits) of the total sample. 

Based on X-ray obs ervations with spectroscopic classifica- 
tions dBrunner et al.l2008l) . we estimate that approximately ~ 25± 
10 per cent of the radio sample is made up of radio-quiet AGN at 
30/iJy < 5'i.4ghz < 300 /iJy, with this fraction rising towards 
brighter flux densities. These results suggest a transition from AGN 
to a dominant star-forming population at sub-mJy radio fluxes. 
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